Extensible, Scalable Monitoring for Clusters of Computers

نویسندگان

  • Eric Anderson
  • David A. Patterson
چکیده

We describe the CARD (Cluster Administration using Relational Databases) system for monitoring large clusters of cooperating computers. CARD scales both in capacity and in visualization to at least 150 machines, and can in principle scale far beyond that. The architecture is easily extensible to monitor new cluster software and hardware. CARD detects and automatically recovers from common faults. CARD uses a Java applet as its primary interface allowing users anywhere in the world to monitor the cluster through their browser.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

XML Opportunities in Real Time Immersive Simulation & Visualization Based on Clusters of Commodity Computers

Real Time Immersive Simulation and Visualization applications have been powered traditionally by high-end graphics workstations or supercomputers. But recently, clusters of commodity computers (PCs, Macintoshes, low cost workstations) have become a practical alternative. The advantages of a commodity cluster include low cost, flexibility, performance scalability and use of to legacy systems. Th...

متن کامل

ACL 2 for Parallel Systems Software : A Progress Report

A significant development in high-performance computing has occurred in recent years with the proliferation of “Beowulf” clusters [6]. Beowulf clusters are parallel computers assembled from commodity-priced personal computers and networks. The explosive growth of the personal computer marketplace, together with rapid technological advances in the hardware sold there, has driven the price/perfor...

متن کامل

Scalable Cluster Based Cloud Storage

We consider a cloud system that has to save lots of files and has to use hundreds of computers. The existing cloud storage designs are not scalable enough to support such a huge number of nodes. In this paper, we propose a novel cloud storage system containing thousands of virtual file servers on hundreds of computers. We group these virtual servers into clusters. This system is perfectly scala...

متن کامل

EXCLAIM framework: a monitoring and analysis framework to support self-governance in Cloud Application Platforms

The Platform-as-a-Service segment of Cloud Computing has been steadily growing over the past several years, with more and more software developers opting for cloud platforms as convenient ecosystems for developing, deploying, testing and maintaining their software. Such cloud platforms also play an important role in delivering an easily-accessible Internet of Services. They provide rich support...

متن کامل

Supporting On-line Distributed Monitoring and Debugging

Monitoring systems have traditionally been developed with rigid objectives and functionalities, and tied to specific languages, libraries and run-time environments. There is a need for more flexible monitoring systems which can be easily adapted to distinct requirements. On-line monitoring has been considered as increasingly important for observation and control of a distributed application. In...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997